Algebraic number field

In mathematics, an algebraic number field (or simply number field) F is a finite (and hence algebraic) field extension of the field of rational numbers Q. Thus F is a field that contains Q and has finite dimension when considered as a vector space over Q.

The study of algebraic number fields, and, more generally, of algebraic extensions of the field of rational numbers, is the central topic of algebraic number theory.

Contents

Definition

Prerequisites

The notion of algebraic number field relies on the concept of a field. Fields consists of a set of elements together with four operations, namely addition, subtraction, multiplication and division by nonzero elements. A prominent example of a field is the field of rational numbers, commonly denoted Q, together with its usual operations of addition etc.

Another notion needed to define algebraic number fields is vector spaces. To the extent needed here, vector spaces can be thought of as consisting of sequences (or tuples)

(x1, x2, ...)

whose entries are elements of a fixed field, such as the field Q. Any two such sequences can be added by adding the entries one per one. In addition, any sequence can be multiplied by a single element c of the fixed field. These two operations known as vector addition and scalar multiplication satisfy a number of properties that serve to define vector spaces abstractly. Vector spaces are allowed to be "infinite-dimensional", that is to say that the sequences constituting the vector spaces are of infinite length. If, however, the vector space consists of finite sequences

(x1, x2, ..., xn),

the vector space is said to be of finite dimension, n.

Definition

An algebraic number field (or simply number field) is by definition a finite degree field extension of the field of rational numbers. Its degree as an extension of Q is simply called its degree.

Examples

a+bi
where both a and b are rational numbers and i is the imaginary unit. Such expressions may be added, subtracted, and multiplied according to the usual rules of arithmetic and then simplified using the identity
i2 = −1.
Explicitly,
(a + bi) + (c + di) = (a + c) + (b + d)i,
(a + bi) (c + di) = (acbd) + (ad + bc)i.
Non-zero Gaussian rational numbers are invertible, which can be seen from the identity
(a+bi)\left(\frac{a}{a^2+b^2}-\frac{b}{a^2+b^2}i\right)=\frac{(a+bi)(a-bi)}{a^2+b^2}=1.
It follows that the Gaussian rationals form a number field which is two-dimensional as a vector space over Q.
Q(√d)
is a number field obtained by adjoining the square root of d to the field of rational numbers. Arithmetic operations in this field are defined in analogy with the case of gaussian rational numbers, d = − 1.
Qn), ζn = exp (2πi / n)
is a number field obtained from Q by adjoining a primitive nth root of unity ζn. This field contains all complex nth roots of unity and its dimension over Q is equal to φ(n), where φ is the Euler totient function.
(1, 0) · (0, 1) = (1 · 0, 0 · 1) = (0, 0).

Algebraicity and ring of integers

Generally, in abstract algebra, a field extension F / E is algebraic if every element f of the bigger field F is the zero of a polynomial with coefficients e0, ..., em in E:

p(f) = emfm + em−1fm−1 + ... + e1f + e0 = 0.

It is a fact that every finite field extension is algebraic. In particular this applies to algebraic number fields, so any element f of an algebraic number field F can be written as a zero of a polynomial with rational coefficients. Therefore, elements of F are also referred to as algebraic numbers. Given a polynomial p such that p(f) = 0, it can be arranged such that the leading coefficient em is one, by dividing all coefficients by it, if necessary. A polynomial with this property is known as a monic polynomial. In general it will have rational coefficients. If, however, its coefficients are actually all integers, f is called an algebraic integer. Any (usual) integer zZ is an algebraic integer, as it is the zero of the linear monic polynomial:

p(t) = tz.

It can be shown that any algebraic integer that is also a rational number must actually be an integer, whence the name "algebraic integer". Again using abstract algebra, specifically the notion of a finitely generated module, it can be shown that the sum and the product of any two algebraic integers is still an algebraic integer, it follows that the algebraic integers in F form a ring denoted OF called the ring of integers of F. It is a subring of (that is, a ring contained in) F. A field contains no zero divisors and this property is inherited by any subring. Therefore, the ring of integers of F is an integral domain. The field F is the field of fractions of the integral domain OF. This way one can get back and forth between the algebraic number field F and its ring of integers OF. Rings of algebraic integers have three distinctive properties: firstly, OF is an integral domain that is integrally closed in its field of fractions F. Secondly, OF is a Noetherian ring. Finally, every nonzero prime ideal of OF is maximal or, equivalently, the Krull dimension of this ring is one. An abstract commutative ring with these three properties is called a Dedekind ring (or Dedekind domain), in honor of Richard Dedekind, who undertook a deep study of rings of algebraic integers.

Unique factorization and class number

For general Dedekind rings, in particular rings of integers, there is a unique factorization of ideals into a product of prime ideals. However, unlike Z as the ring of integers of Q, the ring of integers of a proper extension of Q need not admit unique factorization of numbers into a product of prime numbers or, more precisely, prime elements. This happens already for quadratic integers, for example in OQ(√−5) = Z[√−5], the unicity of the factorization fails:

6 = 2 · 3 = (1 + √−5) · (1 − √−5),

–using the norm it can be shown that these two factorization are actually inequivalent in the sense that the factors do not just differ by a unit in OQ(√−5). Euclidean domains are unique factorization domains; for example Z[i], the ring of Gaussian integers, and Z[ω], the ring of Eisenstein integers, where ω is a third root of unity (unequal to 1), have this property.[1]

ζ-functions, L-functions and class number formula

The failure of unique factorization is measured by the class number, commonly denoted h, the cardinality of the so-called ideal class group. This group is always finite. The ring of integers OF possesses unique factorization if and only if it is a principal ring or, equivalently, if F has class number 1. Given a number field, the class number is often difficult to compute. The class number problem, going back to Gauss, is concerned with the existence of imaginary quadratic number fields (i.e., Q(√d), d ≥ 1) with prescribed class number. The class number formula relates h to other fundamental invariants of F. It involves the Dedekind zeta function ζF(s), a function in a complex variable s, defined by

\zeta_F(s)�:= \prod_{\mathfrak{p}} \frac{1}{1-N(\mathfrak{p})^{-s}}.

(The product is over all prime ideals of OF, N(\mathfrak p) denotes the norm of the prime ideal or, equivalently, the (finite) number of elements in the residue field O_F / \mathfrak p. The infinite product converges only for Re(s) > 1, in general analytic continuation and the functional equation for the zeta-function are needed to define the function for all s). The Dedekind zeta-function generalizes the Riemann zeta-function in that ζQ(s) = ζ(s).

The class number formula states that ζF(s) has a simple pole at s = 1 and at this point (its meromorphic continuation to the whole complex plane) the residue is given by

 \frac{2^{r_1}\cdot(2\pi)^{r_2}\cdot h\cdot \operatorname{Reg}}{w \cdot \sqrt{|D|}}.

Here r1 and r2 classically denote the number of real embeddings and pairs of complex embeddings of F, respectively. Moreover, Reg is the regulator of F, w the number of roots of unity in F and D is the discriminant of F.

Dirichlet L-functions L(χ, s) are a more refined variant of ζ(s). Both types of functions encode the arithmetic behavior of Q and F, respectively. For example, Dirichlet's theorem asserts that in any arithmetic progression

a, a + m, a + 2m, ...

with coprime a and m, there are infinitely many prime numbers. This theorem is implied by the fact that the Dirichlet L-function is nonzero at s = 1. Using much more advanced techniques including algebraic K-theory and Tamagawa measures, modern number theory deals with a description, if largely conjectural (see Tamagawa number conjecture), of values of more general L-functions.[2]

Bases for number fields

Integral basis

An integral basis for a number field F of degree n is a set

B = {b1, …, bn}

of n algebraic integers in F such that every element of the ring of integers OF of F can be written uniquely as a Z-linear combination of elements of B; that is, for any x in OF we have

x = m1b1 + … + mnbn,

where the mi are (ordinary) integers. It is then also the case that any element of F can be written uniquely as

m1b1 + … + mnbn,

where now the mi are rational numbers. The algebraic integers of F are then precisely those elements of F where the mi are all integers.

Working locally and using tools such as the Frobenius map, it is always possible to explicitly compute such a basis, and it is now standard for computer algebra systems to have built-in programs to do this.

Power basis

Let F be a number field of degree n. Among all possible bases of F (seen as a Q-vector space), there are particular ones known as power bases, that are bases of the form

Bx = {1, x, x2, ..., xn−1}

for some element xF. By the primitive element theorem, for there exists such an x, called a primitive element. If x can be chosen in OF and such that Bx is a basis of OF as a free Z-module, then Bx is called a power integral basis, and the field F is called a monogenic field. An example of a number field that is not monogenic was first given by Dedekind. His example is the field obtained by adjoining a root of the polynomial x3x2 − 2x − 8.[3]

The regular representation, trace and determinant

Using the multiplication in F, the elements of the field F may be represented by n-by-n matrices

A = A(x)=(aij)1 ≤ i, jn,

by requiring

x e_i = \sum_{j=1}^n a_{ij} e_j, \quad a_{ij}\in\mathbb{Q}.

Here e1, ..., en is a fixed basis for F, viewed as a Q-vector space. The rational numbers aij are uniquely determined by x and the choice of a basis since any element of F can be uniquely represented as a linear combination of the basis elements. This way of associating a matrix to any element of the field F is called the regular representation. The square matrix A represents the effect of multiplication by x in the given basis. It follows that if the element y of F is represented by a matrix B, then the product xy is represented by the matrix product AB. Invariants of matrices, such as the trace, determinant, and characteristic polynomial, depend solely on the field element x and not on the basis. In particular, the trace of the matrix A(x) is called the trace of the field element x and denoted Tr(x), and the determinant is called the norm of x and denoted N(x).

By definition, standard properties of traces and determinants of matrices carry over to Tr and N: Tr(x) is a linear function of x, as expressed by Tr(x + y) = Tr(x) + Tr(y), Tr(λx) = λ Tr(x), and the norm is a multiplicative homogeneous function of degree n: N(xy) = N(x) N(y), N(λx) = λn N(x). Here λ is a rational number, and x, y are any two elements of F.

The trace form derives is a bilinear form defined by means of the trace, as Tr(x y). The integral trace form, an integer-valued symmetric matrix is defined as tij = Tr(bibj), where b1, ..., bn is an integral basis for F. The discriminant of F is defined as det(t). It is an integer, and is an invariant property of the field F, not depending on the choice of integral basis.

The matrix associated to an element x of F can also be used to give other, equivalent descriptions of algebraic integers. An element x of F is an algebraic integer if and only if the characteristic polynomial pA of the matrix A associated to x is a monic polynomial with integer coefficients. Suppose that the matrix A that represents an element x has integer entries in some basis e. By the Cayley-Hamilton theorem, pA(A) = 0, and it follows that pA(x) = 0, so that x is an algebraic integer. Conversely, if x is an element of F which is a root of a monic polynomial with integer coefficients then the same property holds for the corresponding matrix A. In this case it can be proven that A is an integer matrix in a suitable basis of F. Note that the property of being an algebraic integer is defined in a way that is independent of a choice of a basis in F.

Example

Consider F = Q(x), where x satisfies x3 − 11x2 + x + 1 = 0. Then an integral basis is [1, x, 1/2(x2 + 1)], and the corresponding integral trace form is

\begin{bmatrix}
3 & 11 & 61 \\
11 & 119 & 653 \\
61 & 653 & 3589 \\
\end{bmatrix}.

The "3" in the upper left hand corner of this matrix is the trace of the matrix of the map defined by the first basis element (1) in the regular representation of F on F. This basis element induces the identity map on the 3-dimensional vector space, F. The trace of the matrix of the identity map on a 3-dimensional vector space is 3.

The determinant of this is 1304 = 23 163, the field discriminant; in comparison the root discriminant, or discriminant of the polynomial, is 5216 = 25 163.

Places

Mathematicians of the nineteenth century assumed that algebraic numbers were a type of complex number. This situation changed with the discovery of p-adic numbers by Hensel in 1897; and now it is standard to consider all of the various possible embeddings of a number field F into its various topological completions at once.

A place of a number field F is an equivalence class of absolute values on F. Essentially, an absolute value is a notion to measure the size of elements f of F. Two such absolute values are considered equivalent if they give rise to the same notion of smallness (or proximity). In general, they fall into three regimes. Firstly (and mostly irrelevant), the trivial absolute value | |0, which takes the value 1 on all non-zero f in F. The second and third classes are Archimedean places and non-Archimedean (or ultrametric) places. The completion of F with respect to a place is given in both cases by taking Cauchy sequences in F and dividing out null sequences, that is, sequences (xn)nN such that |xn| tends to zero when n tends to infinity. This can be shown to be a field again, the so-called completion of F at the given place.

For F = Q, the following non-trivial norms occur (Ostrowski's theorem): the (usual) absolute value, which gives rise to the complete topological field of the real numbers R. On the other hand, for any prime number p, the p-adic absolute values is defined by

|q|p = pn, where q = pn a/b and a and b are integers not divisible by p.

In contrast to the usual absolute value, the p-adic norm gets smaller when q is multiplied by p, leading to quite different behavior of Qp vis-à-vis R.

Archimedean places

Calculating the archimedean places of F is done as follows: let x be a primitive element of F, with minimal polynomial (over Q) f. Over R, f will generally no longer be irreducible, but its irreducible (real) factors are either of degree one or two. Since there are no repeated roots, there are no repeated factors. The roots r of factors of degree one are necessarily real, and replacing x by r gives an embedding of F into R; the number of such embeddings is equal to the number of real roots of f. Restricting the standard absolute value on R to F gives an archimedean absolute value on F; such an absolute value is also referred to as a real place of F. On the other hand, the roots of factors of degree two are pairs of conjugate complex numbers, which allows for two conjugate embeddings into C. Either one of this pair of embeddings can be used to define an absolute value on F, which is the same for both embeddings since they are conjugate. This absolute value is called a complex place of F.

If all roots of f above are real or, equivalently, any embedding FC is actually inside R, F is called totally real.

Nonarchimedian or ultrametric places

To find the nonarchimedean places, let again f and x be as above. In Qp, f splits in factors of various degrees, none of which are repeated, and the degrees of which add up to n, the degree of f. For each of these p-adically irreducible factors t, we may suppose that x satisfies t and obtain an embedding of F into an algebraic extension of finite degree over Qp. Such a local field behaves in many ways like a number field, and the p-adic numbers may similarly play the role of the rationals; in particular, we can define the norm and trace in exactly the same way, now giving functions mapping to Qp. By using this p-adic norm map Nt for the place t, we may define an absolute value corresponding to a given p-adically irreducible factor t of degree m by |θ|t = |Nt(θ)|p1/m. Such an absolute value is called an ultrametric, non-Archimedean or p-adic place of F.

For any ultrametric place v we have that |x|v ≤ 1 for any x in OF, since the minimal polynomial for x has integer factors, and hence its p-adic factorization has factors in Zp. Consequently, the norm term (constant term) for each factor is a p-adic integer, and one of these is the integer used for defining the absolute value for v.

Prime ideals in OF

For an ultrametric place v, the subset of OF defined by |x|v < 1 is an ideal P of OF. This relies on the ultrametricity of v: given x and y in P, then

|x + y|v ≤ max (|x|v, |y|v) < 1.

Actually, P is even a prime ideal.

Conversely, given a prime ideal P of OF, a discrete valuation can be defined by setting vP(x) = n where n is the biggest integer such that xPn, the n-fold power of the ideal. This valuation can be turned into a ultrametic place. Under this correspondence, (equivalence classes) of ultrametric places of F correspond to prime ideals of OF. For F = Q, this gives back Ostrowski's theorem: any prime ideal in Z (which is necessarily by a single prime number) corresponds to an non-archimedean place and vice versa. However, for more general number fields, the situation becomes more involved, as will be explained below.

Yet another, equivalent way of describing ultrametric places is by means of localizations of OF. Given an ultrametric place v on a number field F, the corresponding localization is the subring T of F of all elements x such that | x |v ≤ 1. By the ultrametric propery T is a ring. Moreover, it contains OF. For every element x of F, at least one of x or x−1 is contained in T. Actually, since F×/T× can be shown to be isomorphic to the integers, T is a discrete valuation ring, in particular a local ring. Actually, T is just the localization of OF at the prime ideal P. Conversely, P is the maximal ideal of T.

Altogether, there is a three-way equivalence between ultrametric absolute values, prime ideals, and localizations on a number field.

Ramification

Schematic depiction of ramification: the fibers of almost all points in Y below consist of three points, except for two points in Y marked with dots, where the fibers consist of one and two points (marked in black), respectively. The map f is said to be ramified in these points of Y.

Ramification, generally speaking, describes a geometric phenomenon that can occur with finite-to-one maps (that is, maps f: XY such that the preimages of all points y in Y consist only of finitely many points): the cardinality of the fibers f−1(y) will generally have the same number of points, but it occurs that, in special points y, this number drops. For example, the map

CC, zzn

has n points in each fiber over t, namely the n (complex) roots of t, except in t = 0, where the fiber consists of only one element, z = 0. One says that the map is "ramified" in zero. This is an example of a branched covering of Riemann surfaces. This intuition also serves to define ramification in algebraic number theory. Given a (necessarily finite) extension of number fields F / E, a prime ideal p of OE generates the ideal pOF of OF. This ideal may or may not be a prime ideal, but, according to the Lasker-Noether theorem (see above), always is given by

pOF = q1e1 q2e2 ... qmem

with uniquely determined prime ideals qi of OF and numbers (called ramification indices) ei. Whenever one ramification index is bigger than one, the prime p is said to ramify in F.

The connection between this definition and the geometric situation is delivered by the map of spectra of rings Spec OF → Spec OE. In fact, unramified morphisms of schemes in algebraic geometry are a direct generalization of unramified extensions of number fields.

Ramification is a purely local property, i.e., depends only on the completions around the primes p and qi. The inertia group measures the difference between the local Galois groups at some place and the Galois groups of the involved finite residue fields.

An example

The following example illustrates the notions introduced above. In order to compute the ramification index of Q(x), where

f(x) = x3x − 1 = 0,

at 23, it suffices to consider the field extension Q23(x) / Q23. Up to 529 = 232 (i.e., modulo 529) f can be factored as

f(x) = (x + 181)(x2 − 181x − 38) = gh.

Substituting x = y + 10 in the first factor g modulo 529 yields y + 191, so the valuation | y |g for y given by g is | −191 |23 = 1. On the other hand the same substitution in h yields y2 − 161y − 161 modulo 529. Since 161 = 7 × 23,

|y|h = √∣161∣23 = 1 / √23.

Since possible values for the absolute value of the place defined by the factor h are not confined to integer powers of 23, but instead are integer powers of the square root of 23, the ramification index of the field extension at 23 is two.

The valuations of any element of F can be computed in this way using resultants. If, for example y = x2x − 1, using the resultant to eliminate x between this relationship and f = x3x − 1 = 0 gives y3 − 5y2 + 4y − 1 = 0. If instead we eliminate with respect to the factors g and h of f, we obtain the corresponding factors for the polynomial for y, and then the 23-adic valuation applied to the constant (norm) term allows us to compute the valuations of y for g and h (which are both 1 in this instance.)

Dedekind discriminant theorem

Much of the significance of the discriminant lies in the fact that ramified ultrametric places are all places obtained from factorizations in Qp where p divides the discriminant. This is even true of the polynomial discriminant; however the converse is also true, that if a prime p divides the discriminant, then there is a p-place which ramifies. For this converse the field discriminant is needed. This is the Dedekind discriminant theorem. In the example above, the discriminant of the number field Q(x) with x3 − x − 1 = 0 is −23, and as we have seen the 23-adic place ramifies. The Dedekind discriminant tells us it is the only ultrametric place which does. The other ramified place comes from the absolute value on the complex embedding of F.

Galois groups and Galois cohomology

Generally in abstract algebra, field extensions F / E can be studied by examining the Galois group Gal(F / E), consisting of field automorphisms of F leaving E elementwise fixed. As an example, the Galois group Gal (Qn) / Q) of the cyclotomic field extension of degree n (see above) is given by (Z/nZ)×, the group of invertible elements in Z/nZ. This is the first stepstone into Iwasawa theory.

In order to include all possible extensions having certain properties, the Galois group concept is commonly applied to the (infinite) field extension F / F of the algebraic closure, leading to the absolute Galois group G := Gal(F / F) or just Gal(F), and to the extension F / Q. The fundamental theorem of Galois theory links fields in between F and its algebraic closure and closed subgroups of Gal (F). For example, the abelianization (the biggest abelian quotient) Gab of G corresponds to a field referred to as the maximal abelian extension Fab (called so since any further extension is not abelian, i.e., does not have an abelian Galois group). By the Kronecker–Weber theorem, the maximal abelian extension of Q is the extension generated by all roots of unity. For more general number fields, class field theory, specifically the Artin reciprocity law gives an answer by describing Gab in terms of the idele class group. Also notable is the Hilbert class field, the maximal abelian unramified field extension of F. It can be shown to be finite over F, its Galois group over F is isomorphic to the class group of F, in particular its degree equals the class number h of F (see above).

In certain situations, the Galois group acts on other mathematical objects, for example a group. Such a group is then also referred to as a Galois module. This enables the use of group cohomology for the Galois group Gal(F), also known as Galois cohomology, which in the first place measures the failure of exactness of taking Gal(F)-invariants, but offers deeper insights (and questions) as well. For example, the Galois group G of a field extension L / F acts on L×, the nonzero elements of L. This Galois module plays a significant role in many arithmetic dualities, such as Poitou-Tate duality. The Brauer group of F, originally conceived to classify division algebras over F, can be recast as a cohomology group, namely H2(Gal (F), F×).

Local-global principle

Generally speaking, the term "local to global" refers to the idea that a global problem is first done at a local level, which tends to simplify the questions. Then, of course, the information gained in the local analysis has to be put together to get back to some global statement. For example, the notion of sheaves reifies that idea in topology and geometry.

Local and global fields

Number fields share a great deal of similarity with another class of fields much used in algebraic geometry known as function fields of algebraic curves over finite fields. An example is Fp(T). They are similar in many respects, for example in that number rings are one-dimensional regular rings, as are the coordinate rings (the quotient fields of which is the function field in question) of curves. Therefore, both types of field are called global fields. In accordance with the philosophy laid out above, they can be studied at a local level first, that is to say, by looking at the corresponding local fields. For number fields F, the local fields are the completions of F at all places, including the archimedean ones (see local analysis). For function fields, the local fields are completions of the local rings at all points of the curve for function fields.

Many results valid for function fields also hold, at least if reformulated properly, for number fields. However, the study of number fields often poses difficulties and phenomena not encountered in function fields. For example, in function fields, there is no dichotomy into non-archimedean and archimedean places. Nonetheless, function fields often serves as a source of intuition what should be expected in the number field case.

The Hasse principle

A prototypical question, posed at a global level, is whether some polynomial equation has a solution in F. If this is the case, this solution is also a solution in all completions. The local-global principle or Hasse principle asserts that for quadratic equations, the converse holds, as well. Thereby, checking whether such an equation has a solution can be done on all the completions of F, which is often easier, since analytic methods (classical analytic tools such as intermediate value theorem at the archimedean places and p-adic analysis at the nonarchimedean places) can be used. This implication does not hold, however, for more general types of equations. However, the idea of passing from local data to global ones proves fruitful in class field theory, for example, where local class field theory is used to obtain global insights mentioned above. This is also related to the fact that the Galois groups of the completions Fv can be explicitly determined, whereas the Galois groups of global fields, even of Q are far less understood.

Adeles and ideles

In order to assemble local data pertaining to all local fields attached to F, the adele ring is set up. A multiplicative variant is referred to as ideles.

See also

Notes

  1. Ireland, Kenneth; Rosen, Michael (1998), A Classical Introduction to Modern Number Theory, Berlin, New York: Springer-Verlag, ISBN 978-0-387-97329-6 , Ch. 1.4
  2. Bloch, Spencer; Kato, Kazuya (1990), "L-functions and Tamagawa numbers of motives", The Grothendieck Festschrift, Vol. I, Progr. Math., 86, Boston, MA: Birkhäuser Boston, pp. 333–400, MR1086888 
  3. Narkiewicz 2004, §2.2.6

References